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BACKGROUND OF THE INVENTION 

Field of the Invention 

[0001 ] The present invention relates generally to analyzing an enzyme solution. 

More particularly, the invention relates to facilitating an activity determination 
for gel-based/chromatographic-based endpoint or dose response analysis. 

Related Art 

[0002] Electrophoresis is a technique used to separate and analyze single charged 

molecules. The charged molecules can be placed on any type of support matrix, 
such as paper, cellulose acetate, starch gel, agarose gel, or acrylamide gel. 
Generally, a buffer is run in a separation medium containing the support matrix, 
and an electric field is applied to the support matrix. At the end of the run, the 
support matrix is stained appropriately for visualization of the molecules within 
the matrix. 

[0003] Since agarose and acrylamide gels are porous substances, an agarose or 

acrylamide gel-based separation medium permits the molecules to be separated 
by size or molecular weights. The gels, therefore, retard or prevent larger 
molecules from moving, and allow the smaller molecules to migrate freely. 
Agarose gels are generally used to separate larger macromolecules, such as, 
nucleic acids, large proteins and protein complexes, because agarose gels tend to 
be more rigid and easy to handle. Acrylamide gels, on the other hand, are more 
commonly used to separate medium or smaller-sized proteins and small 
oligonucleotides requiring a smaller gel pore size for retardation. 
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[0004] Nucleic acids, including deoxyribonucleic acid (DNA) and ribonucleic 

acid (RNA), tend to carry a negative charge in any buffer used for electrophoresis. 
As such, nucleic acids have a propensity to separate according to their molecular 
weight. The electrophoretic separation of a protein, however, is based on its 
electrical charge and molecular weight. Since proteins are amphoteric 
compounds, their electrical charge depends on the pH of the buffer used for 
electrophoresis. If the pH exceeds the protein's isoelectric point, a negative 
charged protein would migrate towards the anode in the electrical field. If the pH 
is below the isoelectric point, the protein is likely to have a positive charge 
causing it to migrate towards the cathode. 

[0005] A restriction endonuclease (i.e., restriction enzyme) is added to cleave a 

nucleic acid (e.g., DNA or RNA) at certain sites along the macromolecule. 
Similarly, a proteinase or protease (i.e., proteolytic enzyme) is used to break 
protein chains into shorter peptides or break the peptides into amino acids. 

[0006] With respect to DNA, a restriction enzyme has the ability to recognize a 

short, specific sequence of nucleotide bases (such as, adenine, cytosine, thymine, 
and guanine) and severe the DNA molecule at these recognition sites by 
catalyzing the hydrolysis of the bond between adjacent nucleotides. Although 
some types of restriction enzymes are known to cleave DNA at specific sites 
within the recognition site; other types of restriction enzymes cleave DNA 
randomly, sometimes hundreds of bases from the recognition sequence. 

[0007] A restriction enzyme's ability to cut DNA at precise locations is germaine 

to a researcher's ability to isolate gene fragments and recombine them with other 
DNA molecules. Understandably, precise manipulation of DNA fragments is 
crucial to recombinant DNA technology or genetic engineering. 

[0008] It is also important to be able to accurately measure or determine the unit 

activity of a restriction enzyme. The unit activity (also referred to as the "unit 
call") is the least concentrated dilution of restriction enzyme (specifically, 
proteolytic enzyme for proteins) that results in a complete digestion of the 
macromolecule (i.e., nucleic acid or protein). 



[0009] Conventionally, one may determine the unit activity by visually detecting 

when, for example, a DNA fragment would disappear into the background of a 
support matrix. Such subjective calls are prone to human error and inherent 
inaccuracies. As a result, the unit activity could be misjudged by a significant 
factor. Subjective quantitation of enzymes makes it difficult to produce consistent 
products and control production costs. 

[0010] Another problem is related to the separation medium used to analyze 

DNA fragments. The separation medium may have multiple wells or lanes for 
apportioning the DNA samples throughout the gel. Each lane represents the result 
of one reaction. Adjacent lanes can be related to each other such that, as one 
travels from left to right, each lane represents the result of decreasing 
concentration of the enzyme used in each reaction to generate the visualized 
banding pattern seen in each lane of the gel. As the DNA fragments separate 
during electrophoresis, various factors (such as pH levels) can interfere with the 
flow in each lane. These factors can prevent each lane from running equivalently. 
In other words, the fragments do not migrate equivalent distances throughout the 
gel. This results in a wavy pattern (also known as a smile effect) that makes it 
difficult to align the DNA fragments across lanes and interpret the electrophoretic 
results. 

[001 1] Thus, there is a need in the art for a method and device that can accurately 

and obj ectively determine the unit activity or other catalytic results of a restriction 
enzyme. 



SUMMARY OF THE INVENTION 



[0012] The present invention is directed to a method and system for processing 

fragment population information that is generated from a stained macromolecule 
situated in a separation medium to objectively and quantitatively determine 
catalytic results (such as, the unit activity) of an enzyme. The term "enzyme," as 



used herein, is intended to include restriction enzymes, proteolytic enzymes or the 
like. 

[0013] A test aliquot, comprising amacromolecule (such as, DNA, RNA, protein, 

peptide or the like) and diluted enzyme concentration, is distributed in the 
separation medium containing a plurality of wells. The enzyme concentration acts 
as a catalysis to cleave the macromolecule into distinct fragments prior to the 
macromolecule being distributed in the separation medium. Each well within the 
separation medium produces a distinct lane of electrophoretic results. Adjacent 
wells, and hence lanes, have relationship with one another. From left to right, 
each lane represents the result of decreasing concentration of the enzyme used in 
each reaction to generate a visualized banding pattern seen in each lane. The lane- 
to-lane dilution difference in enzyme concentration is the same from one lane to 
the next. 

[0014] In an embodiment of the present invention, an intensity data profile is 

produced from digital images of the fragments to produce a series of stacked 
profiles. The stacked profiles are used to provide a model of the fragments 
resolved from the electrophoretic separation and residing in lanes below the 
reaction wells of the separation medium. 

[0015] The stacked profiles are vertically aligned to designate and assign each 

fragment within its respective lane aligned to the identical fragment in each 
adjacent lane. A group of partial band(s) and final band(s) are selected from the 
fragments. Peak integrations are implemented to measure the intensity of the 
partial and final bands. A series of intensity ratios are computed from the peak 
integrations. The ratios embody the intensity of the specified partial band relative 
to the intensity of the specified final band. 

[0016] After the intensity ratios have been computed and normalized, the 

intensity ratios are used to produce a trend approximation. A threshold crossing 
value is assigned a value at which the trend approximation crosses below a 
threshold crossing level. This unique intersection is characteristic of the enzyme 
strength of the original test sample. 



[0017] A product-specific calibration factor is calculated by dividing the 

threshold crossing value by a historical unit value. The historical unit value is the 
industry-specified amount of restriction enzyme required to obtain complete 
digestion of, for example, a DNA substrate under specified assay conditions. 
Once the calibration factor has been calculated, the threshold crossing value is 
divided by the calibration factor to determine the reported unit call. 

[001 8] In an embodiment, a technique is provided to highlight certain data if their 

removal would change the results by more than one-half of a lane interval, or if 
data points are unusually far from the fitted model. The operator can review the 
highlighted points, exclude the data from analysis, and refit the model. 



BRIEF DESCRIPTION OF THE FIGURES 



[0019] The accompanying drawings, which are incorporated herein and form part 

of the specification, illustrate the present invention and, together with the 
description, further serve to explain the principles of the invention and to enable 
a person skilled in the pertinent art to make and use the invention. In the 
drawings, like reference numbers indicate identical or functionally similar 
elements. Additionally, the leftmost digit(s) of a reference number identifies the 
drawing in which the reference number first appears. 

[0020] FIG. 1 illustrates a high-level block diagram of an embodiment of a 

electrophoretic analysis system. 

[0021] FIG. 2 illustrates on operational flow diagram for the steps involved in 

measuring spectral signatures of an electrophoretic sample according to an 
embodiment of the present invention. 

[0022] FIG. 3 illustrates a dilution matrix according to an embodiment of the 

present invention. 

[0023] FIG. 4 illustrates reaction and control wells from the separation medium 

shown in FIG. 1, according to an embodiment of the present invention. 



[0024] FIG. 5 illustrates an operational flow diagram for the steps involved in 

determining unit activity according to an embodiment of the present invention. 

[0025] FIG. 6 illustrates a stacked profile display according to an embodiment of 

the present invention. 

[0026] FIG. 7 illustrates a partial profile display according to an embodiment of 

the present invention. 

[0027] FIG. 8 illustrates is a block diagram of an example computer system 

useful for implementing the present invention 
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I. System Overview 



[0028] FIG. 1 illustrates, according to an embodiment of the present invention, 

system 100 for rendering objective, quantitative measurements of endonuclease 
activity. System 100 includes a separation medium 102, image capturing device 
108, processor 110 and display 1 12. 

[0029] Separation medium 102 utilizes electrophoretic techniques or 

methodologies to separate macromolecules in hydrated gels of acrylamide, 
agarose or the like. In an embodiment, separation medium 1 02 is, for example, 



of the type described in commonly assigned U.S. Patent No. 5,888,364, issued 
March 30, 1999, in the name of Michael W. Schuette, and entitled "Gel 
Electrophoresis Apparatus" (hereinafter referred to as "the '364 patent"). The 
disclosure of the '364 patent is incorporated herein by reference as though set 
forth in its entirety. However, the present invention should not be interpreted as 
being limited to the configuration of the '364 patent. Separation medium 1 02 can 
be any type of device used for electrophoresis, as would be apparent to a person 
skilled in the relevant art(s). In an embodiment, separation medium 102 includes 
Horizon™ 10-14 gel boxes available from Invitrogen Corporation (formerly, Life 
Technologies Inc.), and miniature external recirculation pumps (e.g., available 
from Cole-Parmer Instrument Company).The pumps produce a gentle flow of 
electrophoresis buffer over the top of the agarose gel bed. This flow prevents 
localized distortions in the sieving characteristics of the agarose gel bed through 
promotion of even resistive heat dissipation. The pumps ideally deliver a preset, 
even volumetric flow rate between 90 to 140 milliliters (ml) per minute. Higher 
flow rates tend to dislodge the agarose gel bed from the casting tray and interrupt 
electrophoresis operations. 
[0030] Referring to FIG. 1, separation medium 102 includes a plurality of 

reaction wells 104a-104p. The area directly below each reaction well 104a-104p 
represents a lane. Samples of DNA, RNA, proteins, peptides or like 
macromolecules are distributed into wells 1 04a- 1 04p where the samples separate 
upon application of an electric current through the gel according to their 
molecular weights. The results of the electrophoretic separations are shown as 
bands 106. Thus, the samples would travel down the respective lane below each 
reaction well 104a-104p to produce one or more bands 106 at the completion of 
the electrophoresis. 

[0031] Image Capturing Device 108 is any type of imaging system used to 

measure the contrast of bands 106. The image contrast is measured as the 
difference in the optical properties, such as absorption, emission, or scattering 
characteristics, between the subject to be imaged (i.e., band 106) and its 
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surroundings or background (i.e., the gel ormechanical structure supporting band 
106). The contrast (i.e., spectral signature or intensity) of the imaged subject is 
measured to create an intensity profile. In an embodiment, image capturing device 
108 measures luminescence (including fluorescence, phosphorescence, 
chemi luminescence, or the like) from the image. For example, the macromolecule 
samples are treated with a flourescent material that emits electromagnetic 
radiation towards image capturing device 108. 

[0032] hi an embodiment, image capturing device 108 is a Digital Science® 

Image Station 440CF using KDS1.0D software, available from Eastman Kodak 
Corporation, and includes a customized, optical quality, ground, colored-glass 
filter element. Images are collected with a Peltier-Cooled, Charge-Coupled 
camera that has at least a sixty micron individual pixel resolution. The resolution 
should include enough pixels to capture the image of interest, and a pixel 
intensity-resolution of minimally 4096 gray scale levels. In an embodiment, the 
filter element is the Chroma Band Pass 505-560 nanometers (ran) anti-reflective 
coated doublet filter or equivalent singlet (495-560 ran) available from Chroma 
Inc. The optical filter can be designated as either Chroma 495 to 550 or Chroma 
505 to 560. In both designs, Russian Blue speciality glass is used in combination 
with a Shott yellow glass, where the difference in performance between the two 
filters is related to the relative thickness of the two glasses in combination. 

[0033] In another embodiment, a dicroic filter is used. However, high quality, 

large obj ective multi-layer reflective dielectric type filters are typically expensive, 
difficult to source at or above a 50 millimeter (mm) objective size, and have 
incidence-angle dependencies that tend to reject fluorescence signals from the 
edge of an image. While in the preferred embodiment, image capturing device 
108 includes an optical filter element, the present invention can be implemented 
without a custom optical filter element. 

[0034] Processor 1 10 receives and analyzes the intensity profiles from image 

capturing device 1 08. Processor 1 10 represents one or more computers providing 
various shared resources with each other and to the other components of system 



100. The shared resources include files for programs, web pages, databases and 
libraries; output devices, such as, printers and plotters; and communications 
devices, such as modems and Internet access facilities. The communications 
devices can support wired or wireless communications, including satellite, 
terrestrial, radio, microwave or any other form or method of transmission. In an 
embodiment, processor 110 is configured to support the standard Internet 
Protocol (IP) developed to govern communications over public and private 
Internet backbones. The protocol is defined in Internet Standard (STD) 5, Request 
for Comments (RFC) 791 (Internet Architecture Board). Processor 110 can also 
support transport protocols, such as, Transmission Control Protocol (TCP), User 
Datagram Protocol (UDP) or Real Time Transport Protocol (RTP). Processor 1 1 0 
is also configured to support various operating systems, such as, the Netware™ 
system available from Novell®; the MS-DOS® system, the Windows NT® 
system or the Windows® 3 .xx/95/98/2000 system available from Microsoft®; the 
Linux® system available from Linux Online Inc.; the Solaris™ system available 
from Sun Microsystems, Inc.; or the like as would be apparent to one skilled in 
the relevant art(s). 

[0035] Display 1 1 2 is a monitor or other display device that communicates with 

processor 1 10. In an embodiment, display 112 provides a text or graphical user 
interface (GUI) that enables a user to interactively communicate with processor 
110. Display 112 can be coupled to a dumb terminal or a personal computer 
configured to query or respond to processor 1 1 0. In an embodiment, display 112 
includes an input device that supports communications with processor 110. The 
input device can be a keyboard, mouse, mouse wheel, joystick, rudder pedals, 
touch screen, microphone, joystick, stylus, light pen, or any other type of 
peripheral unit. The aforementioned is a representative list of input devices that 
can be used with the present invention, it should be understood that any other 
type of input device, as would be apparent to one skilled in the relevant art(s), 
could be easily included and would not change the scope of the invention. Any 
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presently available or future developed device that is responsive to a general 
purpose interface is encompassed by the present invention. 

II. Assay Setup 

[0036] Referring to FIG. 2, flowchart 200 represents the general operational flow 

of an embodiment of the present invention. More specifically, flowchart 200 
shows an example of a control flow for producing and analyzing an image of 
DNA fragments, or any other combination of enzymes and substrates that can be 
resolved into intermediate populations and final populations. As discussed, the 
methods and systems of the present invention can also be used to evaluate the 
cleavage or formation (i.e., Gateway) of RNA, protein, peptides or other 
macromolecules. 

[0037] The control flow of flowchart 200 begins at step 201 and passes 

immediately to step 204. At step 204, the operator prepares a control sample to 
demonstrate normal or expected endonuclease activity likely to have occurred 
during the course of an assay. In an embodiment, three control samples are 
prepared. Early, mid and late stage consistency samples are used as the control 
samples to provide early, mid and late-stage digestion patterns. A fourth sample 
of un-cut DNA substrate is also prepared. An examination of these control 
samples is used to detect over or under-digestion due to reagent or incubation 
variations. This can be accomplished by comparing the measured results from the 
control samples with the prescribed boundaries set for the restriction enzyme that 
is being assayed. 

[0038] At step 208, test samples of a substrate mix (in this embodiment, DNA 

substrate mix) and enzyme concentration are prepared by an operator. In an 
embodiment, single-use aliquots of DNA substrate are combined with a lOx 
reaction buffer that is specified for the enzyme that is being assayed. For some 
enzymes (such as, EcoR II, Nde II, Rsr II or the like), a final concentration of 
dithiothreitol (DTT) is required to support the reaction. Hence, the DNA substrate 



mix would include the requisite volumes of DNA, lOx reaction buffer, DTT (if 
required) and distilled water. 
[0039] The preparation requirements for the enzyme concentration depend on the 

type of assay selected. In an embodiment, two types of assays can be used to 
analyze and measure endonuclease activity. A wide range assay is a scouting 
assay initially used to estimate unit activity. If, on the other hand, the operator is 
highly confident in the unit activity estimate, a narrow range assay can be 
selected. 

[0040] Once the assay type has been selected, an operator prepares a storage 

buffer and transfers the storage buffer to dilution matrix tubes, as would be 
apparent to one skilled in the relevant art(s). FIG. 3 illustrates a dilution matrix 
according to an embodiment of the present invention. FIG. 3 shows a four-by- 
four matrix that represents sixteen tubes used to prepare a series of test samples. 
Four tubes are designated as pre-dilution tubes 301-304, and twelve tubes are 
designated as dilution tubes 305-316. It should be noted, however, that a wide 
range assay of sample concentrations could have twelve pre-dilution tubes. 

[0041] First, an operator calculates the volumes of storage buffer that is added to 

pre-dilution rubes 301-304. This is accomplished by determining atotal fold (TF), 
which is defined as the magnitude increment dilution applied to samples across 
the pre-dilution series. In an embodiment, the total fold is determined by 
referencing Table 1 for a wide range assay or Table 2 for a narrow range assay. 



Table 1 : Wide Range Total Fold 



Estimated Activity 


TF 


Pre-Dilution Series 


2.5 to 25 units per ul 


1 


no pre-dilution series 


25 to 250 units per ul 


10 


10 ul into 90 ul S.B. 


250 to 2,500 units per ul 


100 


10 ul into 990 ul S.B. 


2,500 to 25,000 units per ul 


1,000 


10 ul into 90 ul, then 
10 ul into 990 ul S.B. 
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Table 2: Narrow Range Total Fold 



Estimated Activity 


TF 


Pre-Dilution Series 


4.7 to 47 units per ul 


1 


no pre-dilution series 


47 to 470 units per ul 


10 


10 ul into 90 ulS.B. 


470 to 4,700 units per ul 


100 


10 ul into 990 ul S.B. 


4,700 to 47,000 units per ul 


1,000 


10 ul into 90 ul, then 
10 ul into 990 ul S.B. 



After the total fold has been determined, a dilution target (DT) is also 
determined to serve as an estimate of endonuclease activity. In other words, the 
operator determines a dilution target to provide an estimated guess of the dilution 
factor or unit activity. In an embodiment, the dilution target is determined as: 

Dilution Target = [EA x (TV - SQ)] + AM 

where EA represents the estimated activity in units per microliter (ul); TV 
represents the transfer volume out of the dilution matrix shown in FIG. 3; SQ is 
the substrate DNA quantity per reaction well 104a-104p; and AM is the assay 
midpoint. The assay midpoint depends on the assay type that has been selected. 
For a wide range assay, the assay midpoint is determined by 1 .585 (CeilterLane " 1) . For 
example, if lane six is the center lane, the assay midpoint would be 10.00. If, 
however, a narrow range assay is chosen, the assay midpoint is determined by 
1 Y ^(CenterLane- 1) p Qr exam pi e; if lane six is the center lane, then the assay midpoint 
would be 2.38. The parameters "1.585" and "1.19" are lane-to-lane dilution 
factors that corresponds to the type of assay selected (i.e., wide or narrow range 
assay). 

For example, for a test sample using a wide range assay and having an 
estimated activity of approximately 220 units per ul, a transfer volume of 5.0 ul, 
an assay midpoint of 10.0 and 2.5 micrograms (ug) DNA per reaction, the 
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dilution target would be calculated as 44 fold dilution [i.e., "[220 units per ul x 
(5 ul - 2.5 fig DNA)] - \.5S5 (6 - l) fold"]- 
[0044] After the total fold and dilution target have been determined, Tables 3-4 

are used to calculate the volume per pre-dilution tube 301-304. Specifically, 
Table 3 provides the equations for calculating the volumes for each pre-dilution 
tube 301-304 for a wide range assay. Table 4 provides the equations for such 
calculations with respect to a narrow range assay. 



Table 3: Wide Range Volume Calculation 



Pre-Dilution 
Tube 


Volume 


301 


{[(DT- 


- TF)x 1.585°] -1} x30ul 


302 


{[(DT- 


-TF)x 1.585 1 ]- 1} x 10 ul 


303 


{[(DT- 


-TF) x 1.585 2 ] - 1} xiOul 


304 


{[(DT- 


-TF) x 1.585 3 ] - 1} x 10 ul 


Table 4: Narrow Range Volume Calculation 


Pre-Dilution 
Tube 


Volume 


301 


{[(DT- 


-TF)x 1.19°] -1} x30 ul 


302 


{[(DT- 


-TF) x 1.19 1 ] - 1} x 10 ul 


303 


{[(DT- 


-TF) x 1.19 2 ] - 1} x 10 ul 


304 


{[(DT- 


-TF) x 1.19 3 ] - 1} x 10 ul 



[0045] After the storage buffer has been loaded into pre-dilution tubes 301 -304, 

the storage buffer is transferred to dilution tubes 305-316. If a wide range assay 
has been selected, 53 microliters of the storage buffer are loaded into dilution 
tubes 305-308, 388 ul are loaded into dilution tubes 309-312; and 2,504 ul are 
loaded into dilution tube 313-316. If, however, a narrow range assay has been 
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selected, 20 ul are loaded into dilution tubes 305-308, 60 ul are loaded into 
dilution tubes 309-312; and 141 ul are loaded into dilution tubes 313-316. 

[0046] After the storage buffer has been distributed, the operator prepares the 

initial four dilutions of the restriction enzyme test sample and distributes the test 
sample into pre-dilution tubes 301-304. The amount transferred depends on the 
assay type that has been selected. For a wide range assay, 30 ul of the test sample 
are transferred to tube 301 ; and 1 0 ul are transferred to each of the tubes 302-304. 
For a narrow range assay, 20 ul are transferred to tube 301 ; 15 ul are transferred 
to each of tubes 302-303; and 10 ul are transferred to tube 304. 

[0047] After the restriction enzyme test sample has been distributed among pre- 

dilution tubes 301-304, the operator prepares dilution tubes 305-316 for multiple 
non-serial dilutions. The amount distributed depends on the assay type. For wide 
range assays, 10 ul are transferred from tube 301 to dilution tubes 305, 309 and 
313; 10 ul are transferred from tube 302 to dilution tubes 306, 310 and 314; 10 
ul are transferred from tube 303 to dilution matrix tubes 307, 3 1 1 and 315; and 
10 ul are transferred from tube 304 to dilution tubes 308, 312 and 316. For 
narrow range assays, 20 ul are transferred from tube 301 to dilution tubes 305, 
309 and 313; 20 ul are transferred from tube 302 to dilution tubes 306, 310 and 
314; 20 ul are transferred from tube 303 to dilution tubes 307, 3 1 1 and 315; and 
20 ul are transferred from tube 304 to dilution tubes 308, 312 and 316. 

[0048] Each pre-dilution tube 301-304 and dilution tube 305-316 has a 

corresponding reaction tube (not shown). Likewise, each control sample (i.e., 
early, mid and late-stage consistency samples and an un-cut DNA substrate) has 
a corresponding control tube (not shown). An operator loads each reaction tube 
(not shown) and control tube (not shown) with 120 ul of DNA substrate mix. The 
reaction tubes are placed in a water bath set for a controlled temperature. The 
operator also initiates the reactions one at a time by removing each reaction tube 
(not shown) from the water bath to load the reaction tube with 5 ul of enzyme 
concentration from a corresponding pre-dilution tube 301-304 and dilution tube 
305-316. Similarly, the operator loads each control tube (not shown) with a 
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corresponding control sample. As each reaction is initiated, the operator returns 
the reaction tube (not shown) to the water bath set at the controlled temperature 
for a specified incubation time period. At the end of the specified incubation 
period, the reactions are stopped in the same order as they have been initiated. 
When the reactions are stopped, the restriction enzyme has cleaved the DNA 
substrate into various fragments of digestion. 

[0049] Referring back to FIG. 2, after test samples of the DNA substrate mix and 

diluted enzyme concentration are prepared at step 208, the control flow passes to 
step 212 where test aliquots of the solution are distributed to reaction wells 1 04a- 
104p of separation medium 102 for electrophoresis. The test aliquots are 
distributed by apportioning the DNA substrate mix (including the appropriate 
reaction buffers and diluted enzyme samples) from the reaction tubes (not shown) 
to corresponding reaction wells 104a-104p. The control samples are likewise 
transferred to a designated control well. 

[0050] It should be noted, however, that in an embodiment, the control samples 

are provided for visual analysis and comparison. The control samples are used to 
detect, but not quantify, significant errors related to a final digestion reaction. 
Thus, the control samples are used to develop a confidence level that a reported 
value, as determined below, is valid. In another embodiment, the same enzyme 
concentration is run as a control sample. In this embodiment, reaction wells 1 04a- 
104p are modified to include 20 lanes. As values are generated, as discussed 
below, processor 110 warns the operator if the values are outside an expected 
range. 

[005 1 ] FIG. 4 illustrates an embodiment of separation medium 1 02 that includes 

sixteen reaction wells 1 04a- 1 04p and four control wells 40 1 -404. Reaction wells 
104a-104p are used to resolve the DNA samples. The four control wells 401-404 
are used to evaluate the un-cut DNA substrate and the early, mid and late-stage 
digestion patterns, as discussed above. 

[0052] Referring back to FIG. 2, at step 216, the gel is run to initiate the 

electrophoresis in reaction wells 104a-104p and control wells 401-404. In this 



embodiment, an agarose gel is used to separate the DNA sample. To improve 
resolution for the subsequent imaging process (discussed below), the volume of 
the gel should be 60 milliliters (ml), and the amount of buffer solution in the 
electrophoresis apparatus should be 850 ml. 

Referring back to FIG. 4, separation medium 102 shows the DNA 
substrate fragments as they line up according to molecular weight in the 
respective lane located below the respective reaction wells 1 04a- 1 04p. Likewise, 
fragments from the control samples also line up below the respective control 
wells 40 1 -404. The conditions for separating the DNA substrate fragments should 
be optimized to obtain optimally straight and narrow final banding patterns to 
avoid a wavy pattern or smile effect that often occurs in gel electrophoresis. 
Nonetheless, as discussed below, the methods and system of the present invention 
are designed to overcome the problems due to waviness. 

m. Image Capturing 

Referring again to FIG. 2, in step 220, the operator uses image capturing 
device 108 to produce an image of the DNA fragments residing in the lanes 
below reaction wells 1 04a- 1 04p of separation medium 1 02 . Prior to capturing the 
gel images in an embodiment, the resolved DNA fragments are stained with a 
reporter molecule, such as the SYBR® or SYPRO® fluorescent reagents 
available from Molecular Probes, Inc., including without limitation SYBR® 
Gold, SYBR® Green, SYPRO® Ruby, and the like. The SYBR® Gold 
flourescent reagent, for example, has high fluorescence yield on binding, and 
does not require de-staining prior to being imaged. Other fluorescent regents, 
such as ethidium bromide, can be used. However, ethidium bromide must be de- 
stained to reduce the fluorescent from interfering with the subsequent intensity 
measurements from the gel images. For quantitation of proteins, similar use of a 
fluorescent stain such as the SYPRO® fluorescent reagents (e.g., SYPRO® 
Ruby) available from Molecular Probes, Inc. can be used to detect and quantify 
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the relative concentrations of each protein fragment after electrophoretic 
resolution. A non-fluorescent absorption-mode stain such as Commassie Blue R- 
250 can also be used to detect and quantify a protein fragment, but would require 
destaining. 

[0055] Following the staining and de-staining (if required) process, high spatial 

(approximately 0.1 mm) and intensity resolution (i.e., 12 bit minimum) gel 
images are collected. As discussed, in an embodiment, the images are obtained 
using SYBR® Gold staining with data collected by a Digital Science® Image 
Station 440CF running KDS1.0D software available from Eastman Kodak 
Corporation. The Image Station 440CF is capable of digitizing the images at one- 
minute intervals until a maximum signal greater than 4,000 is reached. To 
improve the reliability of the subsequent intensity measurements, the images 
should be free of visual artifacts, such as, large smears, gel tears, strong row 
curvature (e.g., waviness patterns), fading of bands toward one side of gel, or 
extreme band deformation. 

[0056] As described above in reference to FIG. 1 in an embodiment, image 

capturing device 1 08 includes a custom optical filter element that is configured 
to improve the raw signal quality of the image. For SYBR® dye- fluorescence 
signal quantitation, the filter should be designed to reject short wavelength 
excitation wavelengths below 470 nm entirely as well as longer wavelengths 
above 550 nm. The filters should be able to substantially transmit SYBR® dye 
emitted fluorescence between 495 nm to 530 nm, irrespective of angle of 
incidence, and without introduction of optical distortion. If another reporter 
molecule is used, the optical filter must be adjusted to match the excitation and 
emission characteristics of that particular dye. 

[0057] As discussed, a fluorescent stain such as SYPRO® Ruby, or a non- 

fluorescent absorption-mode stain such as Commassie Blue R-250 can be used 
to quantify the concentration of a protein fragment after electrophoretic 
resolution. The filter system would be adjusted to the properties of each of these 
dyes as specified by the respective manufacturer. 



[0058] At step 224, image capturing device 1 08 processes the images to extract 

the intensity data profile for each lane of DNA fragments. The profile includes 
a set of horizontally summed pixels scanned from top to bottom within a lane. In 
an embodiment, image capturing device 108 uses Digital Science® ID Image 
Analysis Software Version 3.0. available from Eastman Kodak Corporation to 
obtain the intensity data profiles. The software scans each lane on the gel images 
to determine the intensity measurements. It should be understood that any 
combination of molecular reporter, and hardware and software capable of 
digitizing fragment size distributions with comparable fidelity would be 
consistent with the data analysis and extraction operations of the present 
invention. After the intensity data profiles have been extracted, the control flow 
ends as indicated by step 295. 

IV. Unit Activity Projection 

[0059] FIG. 5 illustrates a general operational flow of an embodiment of the 

present invention. More specifically, flowchart 500 shows an example of a 
process for analyzing an image of bands of partial and full DNA fragments to 
quantify endonuclease activity. In an embodiment, a Microsoft® Excel worksheet 
is used to quantify the activity. However, other data processing software or 
programming languages can be used to implement the methods of the present 
invention, as would be apparent to one skilled in the relevant art(s). 

[0060] FIG. 5 begins at step 501 and passes immediately to step 504, where the 

requisite initial parameters are entered by the operator or loaded from a storage 
medium for use during subsequent calculations. The parameters include various 
threshold values used to analyze unit activity, a molecular weight ratio, a 
calibration factor, or the like, as discussed in detail below. 

[0061 ] Additionally at step 504, the operator enters, or system 1 00 loads, a lane- 

to-lane dilution factor which corresponds to the type of assay selected (i.e., wide 
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or narrow range assay). For a wide range assay, the lane-to-lane dilution factor is 
1.585, and for a narrow range assay, the lane-to-lane dilution factor is 1.19. 

[0062] Another parameter loaded or entered at step 504 is a dilution target. As 

described in reference to step 208 in FIG. 2, the dilution factor is based on the 
lane-to-lane dilution factor for the selected assay type. The dilution target serves 
as an estimate of endonuclease activity. However in combination, the dilution 
target and the lane-to-lane dilution factor uniquely and completely describe a set 
of physical dilutions created by manipulating a sample through a dilution matrix, 
such as the dilution matrix illustrated in FIG. 3. Thus, these parameters enable 
processor 110 to properly interpret a raw image profile set to determine the 
amount of dilution applied to the samples that generated the distinct banding 
pattern for each lane profile. 

[0063] Processor 110 would use the dilution target and lane-to-lane dilution 

factor to produce an estimated guess of unit activity by calculating an effective 
dilution target representing 66-75% complete digestion. First, a lane-specific 
physical dilution factor is calculated as: 

Physical Dilution Factor (Lane) = (DT * LL (Lane " ,} ) * (TV/SQ) 

where "Lane" identifies the specific lane number; and DT, LL, TV and SQ are the 
values discussed in reference to step 208 for the dilution target, lane-to-lane 
dilution factor; transfer volume, and substrate DNA quantity, respectively. The 
physical dilution factor must be adjusted to account for the volume of reaction 
mixture containing the DNA substrate. In an embodiment, 5 ul of sample is 
transferred into 120 ul of reaction mixture containing 2.5 ug DNA substrate. 
Based on empirical observations, the physical dilution factor in this embodiment 
must be reduced by a factor to two to produce a more accurate measure of the test 
sample unit strength. As such, the lane-specific effective dilution factor is 
calculated as: 



-20- 



Effective Dilution Factor (Lane) = Physical Dilution Factor (Lane) - 2 

where the value "2" is the unit strength adjustment factor. As would be apparent 
to one skilled in the relevant art(s), the unit strength adjustment factor would vary 
according to the reaction volume, substrate quantity, transfer volume or the like. 
As such, effective dilution factors representing 66-75% completion are calculated 
to derive a projection of a unit call (herein referred to as a "reported unit call") 
representing complete digestion as per unit definition. 
[0064] Referring back to step 504, another parameter loaded or entered is a 

threshold crossing level. The threshold crossing level is assigned a value where 
the bands of DNA fragments continue to provide a strong, robust intensity signal, 
but is set sufficiently above background noise levels to assure an accurate signal 
quantitation. For instance, for a narrow range assay, the threshold crossing should 
be between the second and third intensity data points that still have a clear signal. 
However, the threshold crossing for a narrow range assay can be set at any point 
above an area having at least two noise data points above the background noise 
level, or having at least two data points above a region where the operator has a 
high degree of confidence. 
[0065] Also, entered or loaded at step 504 is a historical unit assay value for the 

restriction endonuclease that is based on empirical observations, industry 
standards or vendor specifications. In short, the historical unit assay value states 
the restriction enzyme unit definitions at which point customers can expect to 
attain complete digestion of substrate DNA under a set of prescribed conditions. 
In an embodiment, the historical unit assay value represents the amount of 
enzyme required to digest 1 ug of the appropriate substrate DNA completely in 
sixty minutes under the conditions specified for the enzyme. 
[0066] At step 5 08 , the intensity data profiles are received from a memory source 

or image directory. The intensity data profiles can be retrieved from an input file 
stored in a temporary or permanent memory location on a hard disk drive or 
removable storage device, such as a floppy diskette, magnetic tape, optical disks, 
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or the like, and loaded into processor 110. In addition to the intensity data, the 
input file would identify the type of enzyme that has been assayed. An output file 
can also be created to store relevant test results, as described below in further 
detail. 

[0067] At step 512, processor 110 reads the intensity measurements (e.g., pixel 

values) from the intensity data profiles into a data structure for further processing. 
As discussed, each intensity data profile represents chromatographic data derived 
from a distinct lane within the digital image that is produced from image 
capturing device 108. In an embodiment, processor 110 adjusts or corrects the 
intensity data profiles to remove background intensity and/or scattering effects. 
This process helps to reduce any interference caused by the fluorescent reagent 
or the gel support matrix used to stain the DNA fragments. In an embodiment, the 
correction process is implemented by subtracting an offset from each pixel. The 
offset corresponds roughly to the background fluorescence and excitation source 
scattered breakthrough. The offset is computationally equivalent to the lowest 
pixel in each intensity data profile. 

[0068] The background corrected intensity data profiles are copied to a second 

data structure. With the second data structure, processor 110 inserts linearly 
interpolated points between each raw data point in the original set of intensity 
data profiles. Processor 110 uses this technique to mitigate any problems arising 
from the DNA bands not being straight or the DNA bands producing a wavy 
pattern in the digital image. 

[0069] Upon completion of the point-by-point linear insertions, peaks are 

detected within each set of pixel data from the modified intensity data profiles. 
In other words, each lane (as represented by a respective intensity data profile) 
would contain one or more peaks that signal higher levels of intensity 
measurements. Each peak represents, for example, a DNA substrate fragment 
(i.e., DNA band). 

[0070] In an embodiment, processor 110 traces the pixels point-by-point to detect 

the peaks. A state-engine based derivative tracking system is used to detect the 
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changes in trace direction that occur at peaks and troughs in a pixel data set. The 
state-engine is governed by logical limitations that prevent frivolous response to 
tiny fluctuations in the pixel data set. The first limit is based on hysteresis or 
minimum change. This limit is set as a fraction of the maximum pixel intensity 
within a given intensity data profile. Therefore, the limit adjusts to intensity 
variations from one intensity data profile to the next. 
[0071] The second logical limitation is a peak rejection interval which is a 

predetermined number of pixel data points. This limitation is based on an 
assumption that the peaks of interest will be spaced no closer than a certain 
number of pixel data points. Therefore, the state-engine would reject peaks or 
troughs that occur at intervals more closely spaced than the peak rejection 
interval. This limitation is empirically determined based on the resolution of 
image capturing device 108, zoom optics, and quality of the electrophoretic 
separation. In an embodiment, the peak rejection interval is set to two pixel data 
points. 

[0072] Once a listing of peaks is obtained for each intensity data profile, the 

peaks are sorted and aligned to determine the peaks that most likely correspond 
to each other from profile to profile (i.e., referring to the captured image, from 
lane to lane). This allows the DNA bands to be correctly associated with the 
proper lane (i.e., from the sixteen reaction wells 104a-104p). This is implemented 
by processor 110 selecting an intensity data profile (and, hence, a lane), and 
matching the peaks in the selected profile with the most likely associated peaks 
in the other profiles. 

[0073] This is performed as a three step process. Processor 1 1 0 would select the 

intensity data profile corresponding with, for example, lane 10. Based on 
empirical data, the lane-to-lane band pixel offsets in high-quality images rarely 
exceed plus or minus seven full pixels. Thus, in the first step, a list of pixel 
positions in lane 10 is compared to the pixel positions in lane 9. A scanning 
process locates the closest peaks in lane 9 within seven pixels of each peak in 
lane 10. If one peak in lane 9 is assigned to two or more peaks in lane 10, the 
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assigned peaks are scanned for consistency relative to other neighboring peak 
offsets, and one peak is chosen. The other peaks are de-assigned. 

[0074] The process continues comparing lane 9 to lane 8, lane 8 to lane7, and so 

forth until lane 1 is processed. A similar process compares lane 10 with lane 1 1 , 
lane 1 1 to lane 12, and so forth until lane 16 is encountered. 

[0075] Upon completion, processor 110 establishes a table to associate each peak 

to a neighboring lane. The differences are calculated and also stored in the table 
as peak offset values. The peak offset values are scanned for image-wide 
relationships. During this process, processor 110 searches for peak systems that 
continue as far as possible from lane 10 to lane 1 and to lane 16. Those peaks that 
form complete mountain-line ranges of associated peaks are identified and copied 
to a third data structure. 

[0076] The offsets between neighboring peaks are refined by application of an 

iterative comparison process. For each lane, a set of nine pixels having a centered 
peak pixel is compared to a set of similar sets of nine pixels from a flanking lane 
centered about the peak offset values discussed above. This is implemented by 
using fractional pixel steps made possible from the continuous linearly 
interpolated values derived from the original intensity data profiles. During the 
iterative comparison, processor 110 calculates the summed square difference 
between the nine pixels from each lane. Each multiple pixel sum square 
difference is compared to the previous best sum with lower values considered to 
be closer matches between the two lanes snippets. The fractional offset that yields 
the lowest score is taken to be the refined offset between two peaks, and is stored 
in the data structure associated with that pair of peaks. This process continues 
down each lane and across from lane 10 to lane 1 and lane 10 to lane 16. For each 
peak in each lane, an integrated refined offset is then calculated by tracing along 
the association path. This value is stored in a data table associated with the peak 
itself. 

[0077] The integrated refined offsets are taken to be guideposts in an intensity 

data profile that indicate points of equivalence among multiple intensity data 
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profiles. A function is defined to return the intensity of pixels in a lane in terms 
of a reference frame based on the pixels in lane 1 0. In other words if one requests 
a corrected pixel 27 in lane 4, the computation calculates the actual position in 
lane 4 that would correspond to pixel 27 in lane 10. This calculation includes a 
routine that expands and/or contracts segments of any given lane other than lane 
10. This is accomplished by linear interpolation in a piecewise fashion where the 
peaks in lane 4 corresponding to peaks flanking pixel 27 in lane 10 are used as 
guideposts and indeed endpoints for the interpolation process. For each pixel 
within this flanking region, an offset is calculated as the linearly interpolated 
value of the offset at the endpoints and the returned actual pixel position in lane 
4 (which is likely fractional) is returned. This actual pixel position corresponding 
to pixel 27 in lane 10 is retrieved from the lane 4 data set with fractional values 
returned by linear interpolation in all cases. Thus, the expansion / contraction is 
accomplished through use of associated refined guidepost peak positions to 
generate linearly interpolated offset indexes that are used as direct fractional-pixel 
queries into the original linearly interpolated data set through the lens of the 
defined expansion/contraction function. 

[0078] In an embodiment, no accommodation is made for an apparent gain or 

loss of original signal intensity that occurs on expansion and contraction. It has 
been observed that the correction methodologies or techniques of the present 
invention generally are consistent in a local sense and lead to minute 
discrepancies. To increase the accuracy in intensity integration, the returned pixel 
intensity for a given location is adjusted by a degree of compression at that pixel. 
This is defined as the change in offset from an upper to a lower guidepost peak 
divided by the number of pixels between the peaks in the original peak data set 
and would be a number very close to unity in most cases. 

[0079] To present an aligned set of pixels, processor 110 scans through the data 

sets for each lane using the expansion/contraction function and displays the data 
as lists using the charting functions of a graphics application, such as the 
Microsoft ® Excel application available from Microsoft Corporation (Redmond, 
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WA). The display is created in half pixel increments for clarity and presentation 
purposes. The profiles are given a fixed offset for display only that allows them 
to be displayed one on top of the next thus making the trends in the data set 
intuitive and easy to see. 

[0080] FIG. 6 shows this series of stacked lane profiles that are displayed on 

display 1 1 2. As can be seen, the peaks from the profiles are vertically aligned. In 
other words, the related DNA bands (represented by the peaks) in the stacked lane 
profiles share the same region along the horizontal axis of a graph. 

[0081] Referring back to FIG. 5, at step 5 1 6, a set of partial and final DNA bands 

are selected for analysis. In an embodiment, processor 110 selects the partial and 
final bands based on preset criteria. In another embodiment, the operator 
designates the locations of these two bands of interest by using an input device 
(e.g., mouse) to position graphical brackets on either side of the desired peaked- 
stacked sets. Referring again to FIG. 6, the peaks or DNA bands selected as the 
partial bands are designated with a dotted bracket, and the peaks or DNA bands 
selected for the final bands are designated with a solid bracket. The edges of the 
brackets define the background for the peak measurements. 

[0082] Accordingly, the operator positions brackets around a pre-defined set of 

peaks which must include one or more partial band peak associations and one or 
more final band associations. The bracket positions are used to define initial 
limits of integration for each peak within associated peak sets. To be more 
specific, the initial limits of integration for each peak are refined iteratively by 
scanning two pixels on either side of both the lower and upper initial limits 
searching for the refined limits that return the largest integrated sum. The reason 
for this final refinement is seated in both relieving the operator of finding an 
exactly optimum bracket position and in allowing for any small residual 
alignment drift that may be present across the full 16 lane span of the data set. 
The resulting integrated peak intensities for the partial and final bands are stored. 

[0083] Referring back to FIG. 5, at step 520, processor 110 evaluates the 

bracketed regions to extract a set of intensity ratios. Each intensity ratio relates 
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the intensity value of the partial band to the intensity value of the final band. The 
intensity values for the bands are computed from the local background corrected 
integrated peak intensities from bracketed regions. Accordingly, an intensity ratio 
is computed for each lane from the gel image. The calculated intensity ratios are 
estimates of the extent to which the DNA substrate has been digested by the 
enzyme concentration in each reaction well 104a-104p. 
[0084] Since the intensity of a DNA band is proportionally related to its 

molecular weight, the intensities ratios can also be normalized by taking the 
product of the integrated intensity values and the inverse of their respective 
molecular weights, or: 

Normal Intensity Ratio = [(Ip/L-) x (MW/MW p )] 

where Ip and MW p represent the integrated intensity value and molecular weight, 
respectively, for a partial band, and I f and MW f represent the integrated intensity 
value and molecular weight for a final band. 
[0085] During the peak integrations described to compute the intensity ratios, 

processor 110 is operable to adjust or correct the intensity data profiles by 
removing outliers or other faulty data. In an embodiment, processor 110 
highlights intensity data points with a "1" to signal questionable data points. The 
"1" indicates the reported unit activity is strongly influenced by the associated 
data point. 

[0086] In an embodiment, a data point is highlighted with a "1 "if its exclusion 

changes the result by more than 25% for a wide range assay, or by more than 1 0% 
for a narrow range assay. Processor 110 also highlights data points with a "2" if 
they are judged to be located at an unusual distance from a predetermined range 
but does not change the final unit result. Type-2 highlights draw attention to 
minor potential anomalies in the intensity data profile that when properly 
reviewed can lead to an improvement in result certainty. The operator can choose 
to exclude the type-1 or type-2 data points by changing the designation to a "0." 
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Once the data point is excluded, processor 110 refits the intensity data to the 
model. 

[0087] Again, an empirical assumption is made for the final process of the data. 

That assumption is that the normalized band intensity ratio should decrease 
monotonically in the region where the unit value for the enzyme concentration 
had been historically assigned. That is, one must know that based on statistical 
modeling of the restriction reaction, it is not physically feasible to predict an 
abrupt increase in the normalized band intensity ratio with decreasing sample 
dilution. However, such abrupt increases are from time to time noted in practice. 
These abrupt increases are attributable to physical limitations of the experimental 
implementation including pipette inaccuracy, operator error, and simple mixing 
problems. In fact, the degree to which a series of normalized band intensity ratios 
smoothly define a transition is a strong indication of validity in the final result. 
This observation is a matter of intrinsic design for the present invention starting 
with the use of an interleaved, expanded tube number dilution matrix having 
equally tempered geometric lane-to-lane dilution spacing. 

[0088] A unique, robust and efficient method has been developed to take 

advantage of these observations. The method is based on finding the lowest score 
configuration of a multi-parameter coupled-body mathematical model. Each body 
is initially placed on top of each point on the raw normalized band intensity ratios 
plotted on a double logarithmic chart versus the effective dilution factors. A 
higher score is assigned to a body if the change in the vector direction from that 
body to its neighbors differs from the same change of those neighbors. That is, 
lower scores are assigned to smoothly curving configurations of points. This is 
referred to as the curvature strain. The sum of these squared curvature changes 
is calculated. A similar sum is made of the two-dimensional square differences 
for each of the bodies from the original data set points. The curvature strain term 
is weighted and summed with the distance sum. This forms the score of any given 
coupled body configuration. 
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[0089] An optimization model is constructed to find the most optimal 

configuration of the coupled bodies based on minimizing the above score 
function. The optimization model executes an implementation of non-linear 
optimization, so that each intensity axis value is changed by the optimization 
model for the coupled-bodies until the smallest score is obtained. In an 
embodiment, the optimization model is based on the Excel Solver function from 
the Microsoft® Excel application available from Microsoft Corporation. 
However, other software applications can be built or used as would be apparent 
to one skilled in the relevant art(s). 

[0090] At step 524, the normalized intensity ratios are used to determine a 

threshold crossing value. Generally, the normalized intensity ratios approach 
lower limits as the dosage of enzyme yields complete digestion of the DNA 
substrate. When a calculated trend from these intensity ratios approaches a 
threshold crossing level, the corresponding threshold crossing value provides a 
measure of dilution. The threshold crossing level is one of the initial parameters 
entered at step 504. 

[0091] The intensity ratio trend is calculated from the final configuration of 

coupled-bodies from the optimization model. The final configuration represents 
the very-likely reaction trajectory described by the underlying raw normalized 
band intensity ratio values. Thus, processor 1 10 produces a linearly-interpolated 
line which is calculated to connect these bodies. The intersection of this line with 
a line set at the threshold crossing level defines the threshold crossing value (in 
terms of effective dilution factor). If a data point is located far away from the final 
coupled body configuration, the data point is highlighted, as described above, as 
a possible point in error for further operator review. 

[0092] FIG. 7 illustrates a partial profile display according to an embodiment of 

the present invention. FIG. 7 shows a trend produced from a set of intensity 
ratios. The trend is plotted on a log scale of intensity ratios versus effective 
dilution factors. In this example, the threshold crossing level, from step 504, is 
designated as 0.01. As discussed, the threshold crossing level is selected at a 
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point above the noise region, but where a strong, robust intensity signal can be 
quantified. The trend approaches the threshold crossing level at apoint designated 
as the threshold crossing value. In this example, the threshold crossing value is 
approximately "1 8.9." 

[0093] After the trend has been approximated and the threshold crossing value 

has been computed, the control flow passes to step 528. At step 528, a calibration 
factor is computed from the threshold crossing value. As discussed, the threshold 
crossing value is a derived value dependent on effective dilution factors, and an 
effective dilution factor does not represent the complete digestion of the DNA 
substrate. Rather, the effective dilution factor is a measure of a partial digestion 
of the DNA substrate at some point prior to its complete digestion, usually 
between 66 to 75% complete. 

[0094] To calculate the calibration factor, the threshold crossing value is divided 

by the historical unit assay value entered at step 504 above. The historical unit 
assay value is used to convert the threshold crossing value into a historically 
expected unit by assay of enzyme products having well accepted historical unit 
strengths. Once the calibration factor has been established under known, 
controlled conditions, the calibration factor is fixed and remains unchanged 
during assay of unknown samples. 

[0095] At step 532, the reported unit value is calculated by dividing the threshold 

crossing value by the calibration factor from step 528. Accordingly, the 
calibration factor is used to roughly calibrate the system and method of the 
present invention with unit activity measurements that are historically accepted 
by the restriction endonuclease industry. As a result, the present invention 
maximizes product consistency. Moreover, the methods and systems of the 
present invention allow unit activity to be determined objectively from intensity 
measurements. The reported unit call is an objective estimation of reaction 
completion that is not based on conventional methods of visually detecting the 
vanishing of band signal intensity. 
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[0096] In an embodiment, partial-to-final band ratios, threshold crossing values, 

unit calls, other results and/or other context information are stored in an output 
file or a database (not shown) for historical reference. Additionally, in an 
embodiment, the range of certainty in the reported result(s) is also 
computationally estimated using a Monte-Carlo technique. Thirty permutations 
of the sixteen data point x and y axis in the original data set are created. The 
permutations are created using normally distributed random numbers centered 
around zero and having standard deviations based on known inaccuracies in the 
assay design. For each of these thirty permutations, the resulting unit values are 
obtained and stored. Finally, a standard deviation calculation is applied to the unit 
values to obtain an estimate of the stability of the raw data set configuration. 

[0097] A similar sort of operation is also applied to sixteen further permutations 

of the raw data set through point-by-point deletion. Again, a standard deviation 
calculation is applied to the resulting list of unit values. Together, the statistically 
combined standard deviations are combined, multiplied by two, and reported as 
the range of error for the originally reported unit value. Processor 110 would also 
set a flag for further operator review for any raw data point whose removal would 
causes a change of greater than a certain limit in the reported unit value. This 
prevents the present invention from reporting unit values that are supersensitive 
to a single data point. 

V. Software and Hardware Embodiments 

[0098] As is apparent from the foregoing description, the present invention was 

developed primarily to measure enzyme unit activity for digesting a DNA, RNA, 
protein or peptide substrate. However, it would be clear to persons skilled in the 
relevant art(s) that the analysis techniques of the present invention have utility 
beyond the endonuclease activity described above. The invention has application 
outside this area and can be used generally to analyze and measure any type of 
titrations or biochemical activities. 
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[0099] The present invention (e.g., system 100, image capturing device 108, 

processor 110, display 112, or any part thereof) can be implemented using 
hardware, software or a combination thereof and can be implemented in one or 
more computer systems or other processing systems. In fact, in an embodiment, 
the invention is directed toward one or more computer systems capable of 
carrying out the functionality described herein. 

[0100] Referring to FIG. 8, an example computer system 800 useful in 

implementing the present invention is shown. The computer system 800 includes 
one or more processors, such as processor 804. The processor 804 is connected 
to a communication infrastructure 806 (e.g., a communications bus, cross-over 
bar, or network). Various software embodiments are described in terms of this 
exemplary computer system. After reading this description, it will become 
apparent to a person skilled in the relevant art(s) how to implement the invention 
using other computer systems and/or computer architectures. 

[0101] Computer system 800 can include a display interface 802 that forwards 

graphics, text, and other data from the communication infrastructure 806 (or from 
a frame buffer not shown) for display on the display unit 830. 

[0102] Computer system 800 also includes a main memory 808, preferably 

random access memory (RAM), and can also include a secondary memory 810. 
The secondary memory 810 can include, for example, a hard disk drive 812 
and/or a removable storage drive 814, representing a floppy disk drive, a 
magnetic tape drive, an optical disk drive, etc. The removable storage drive 814 
reads from and/or writes to a removable storage unit 818 in a well-known 
manner. Removable storage unit 818, represents a floppy disk, magnetic tape, 
optical disk, etc. which is read by and written to removable storage drive 8 14. As 
will be appreciated, the removable storage unit 818 includes a computer usable 
storage medium having stored therein computer software and/or data. 

[0103] In alternative embodiments, secondary memory 810 can include other 

similar means for allowing computer programs or other instructions to be loaded 
into computer system 800. Such means can include, for example, a removable 
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storage unit 822 and an interface 820. Examples of such can include a program 
cartridge and cartridge interface (such as that found in video game devices), a 
removable memory chip (such as an EPROM, or PROM) and associated socket, 
and other removable storage units 822 and interfaces 820 which allow software 
and data to be transferred from the removable storage unit 822 to computer 
system 800. 

[0104] Computer system 800 can also include a communications interface 824. 

Communications interface 824 allows software and data to be transferred between 
computer system 800 and external devices. Examples of communications 
interface 824 can include a modem, a network interface (such as an Ethernet 
card), a communications port, a PCMCIA slot and card, etc. Software and data 
transferred via communications interface 824 are in the form of signals 828 which 
can be electronic, electromagnetic, optical or other signals capable of being 
received by communications interface 824. These signals 828 are provided to 
communications interface 824 via a communications path (i.e., channel) 826. 
This channel 826 carries signals 828 and can be implemented using wire or cable, 
fiber optics, a phone line, a cellular phone link, an RF link and other 
communications channels. 

[01 05] In this document, the terms "computer program medium" and "computer 

usable medium" are used to generally refer to media such as removable storage 
drive 814, a hard disk installed in hard disk drive 812, and signals 828. These 
computer program products are means for providing software to computer system 
800. The invention is directed to such computer program products. 

[0106] Computer programs (also called computer control logic) are stored in 

main memory 808 and/or secondary memory 810. Computer programs can also 
be received via communications interface 824. Such computer programs, when 
executed, enable the computer system 800 to perform the features of the present 
invention as discussed herein. In particular, the computer programs, when 
executed, enable the processor 804 to perform the features of the present 
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invention. Accordingly, such computer programs represent controllers of the 
computer system 800. 
[01 07] In an embodiment where the invention is implemented using software, the 

software can be stored in a computer program product and loaded into computer 
system 800 using removable storage drive 814, hard drive 812 or communications 
interface 824. The control logic (software), when executed by the processor 804, 
causes the processor 804 to perform the functions of the invention as described 
herein. 

[0108] In another embodiment, the invention is implemented primarily in 

hardware using, for example, hardware components such as application specific 
integrated circuits (ASICs). Implementation of the hardware state machine so as 
to perform the functions described herein will be apparent to persons skilled in 
the relevant art(s). 

[0109] In yet another embodiment, the invention is implemented using a 

combination of both hardware and software. 

VI. Conclusion 

[0110] While various embodiments of the present invention have been described 

above, it should be understood that they have been presented by way of example, 
and not limitation. It will be apparent to persons skilled in the relevant art(s) that 
various changes in form and detail can be made therein without departing from 
the spirit and scope of the invention. Thus, the present invention should not be 
limited by any of the above described exemplary embodiments, but should be 
defined only in accordance with the following claims and their equivalents. 



